Research and Realization about Conversion Algorithm of PDF Format into PS Format

نویسندگان

  • Xingfu Wang
  • Lei Qian
  • Fuyou Mao
  • Zhaosheng Zhu
چکیده

This paper firstly introduces the characteristics of PostScript document and PDF document as the basis, and proposes the necessity and the feasibility of the conversion from the PDF document format to the PostScript language program. Secondly, it studies the main algorithm and technology of the conversion process and realizes the information extraction for PDF document lastly, with achieving the software algorithm for the conversion from PDF document format into PS format on the basis of the

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PDF2XML: Converting PDF to XML

XML is a markup language for documents containing structured information. It is designed to make it easy to interchange structured documents over the Internet and further integrate them with management database system. PDF is a document format intended to electronically reproduce the look of a page. There is a huge demand of converting existing PDF documents into XML documents, so that they wil...

متن کامل

From Legacy Documents to XML: A Conversion Framework

We present an integrated framework for the document conversion from legacy formats to XML format. We describe the LegDoC project, aimed at automating the conversion of layout annotations layout-oriented formats like PDF, PS and HTML to semantic-oriented annotations. A toolkit of different components covers complementary techniques the logical document analysis and semantic annotations with the ...

متن کامل

Reverse Engineering of Network Software Binary Codes for Identification of Syntax and Semantics of Protocol Messages

Reverse engineering of network applications especially from the security point of view is of high importance and interest. Many network applications use proprietary protocols which specifications are not publicly available. Reverse engineering of such applications could provide us with vital information to understand their embedded unknown protocols. This could facilitate many tasks including d...

متن کامل

Presentable Document Format: Improved On-demand PDF to HTML Conversion

Search engines such as Google and MSN Search crawl and index files in Adobe’s Portable Document Format (PDF) alongside material in HTML. Google furthermore offers a View as HTML option for PDF that includes query term highlighting. The visual appearance of these HTML files converted from PDF is very poor. In this paper we claim that significant improvements to the quality of on-demand PDF to HT...

متن کامل

Conversion of TEX fonts into Type 1 format

This paper analyses the problem of converting TEX fonts to Type 1 fonts, describes TEXtrace, a new free conversion program, and compares it to other possible methods and existing utilities. TEXtrace works by rendering the font in high resolution and then tracing (vectorizing) it. keywords: PDF, font conversion, Type1, METAFONT, vector, outline, raster, bitmap,

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010